Speaker recognition using a trajectory-based segmental HMM
نویسندگان
چکیده
A segmental HMM is a HMM whose states are associated with sequences of acoustic feature vectors (or segments), rather than individual vectors. By treating segments as homogeneous units it is possible, for example, to develop better models of speech dynamics. This paper begins by describing a type of segmental HMM in which the relationship between the state and acoustic level descriptions of a speech signal is regulated by an intermediate, articulatory layer, and discusses its potential benefits for speaker recognition. As a first step towards applying this type of model to speaker recognition, text-dependent speaker verification results obtained on YOHO using a simpler segmental HMM are presented, which show a 44% reduction in false acceptances using the segmental model compared with a conventional HMM. Experiments in text-independent speaker verification on Switchboard are then described.
منابع مشابه
Speaker adaptation of trajectory HMMs using feature-space MLLR
Recently, a trajectory model, derived from the hidden Markov model (HMM) by imposing explicit relationships between static and dynamic features, has been proposed. The derived model, named trajectory HMM, can alleviate two limitations of the HMM: constant statistics within a state and conditional independence assumption of state output probabilities. In the present paper, a speaker adaptation a...
متن کاملSpeech recognition using non-linear trajectories in a formant-based articulatory layer of a multiple-level segmental HMM
This paper describes how non-linear formant trajectories, based on ‘trajectory HMM’ proposed by Tokuda et al., can be exploited under the framework of multiple-level segmental HMMs. In the resultant model, named a non-linear/linear multiple-level segmental HMM, speech dynamics are modeled as non-linear smooth trajectories in the formant-based intermediate layer. These formant trajectories are m...
متن کاملElimination of trajectory folding phenomenon: HMM, trajectory mixture HMM and mixture stochastic trajectory model
In this paper, a study of topology of Hidden Markov Model (HMM) used in speech recognition is addressed. Our main contribution is the introduction of the notion of trajectory folding phenomenon of HMM. In complex phonetic contexts and in speaker-variability, this phenomenon degrades the discriminability of HMM. The goal of this paper is to give some explanation and experimental evidence suggest...
متن کاملشبکه عصبی پیچشی با پنجرههای قابل تطبیق برای بازشناسی گفتار
Although, speech recognition systems are widely used and their accuracies are continuously increased, there is a considerable performance gap between their accuracies and human recognition ability. This is partially due to high speaker variations in speech signal. Deep neural networks are among the best tools for acoustic modeling. Recently, using hybrid deep neural network and hidden Markov mo...
متن کاملA Recognition Method Using Synthesis-based Scoring That Incorporates Direct Relations between Static and Dynamic Feature Vector Time Series
It is well known that hidden Markov models (HMMs) can only exploit the time-dependence in the speech process in a limited way. Parametric trajectory models have been proposed to exploit this time-dependency. However, parametric trajectory modeling methods are unable to take advantage of efficient HMM training and recognition methods. This paper describes a new speech recognition technique that ...
متن کامل